THE MISSING LOG IN LARGE DEVIATIONS FOR 
TRIANGLE COUNTS 



SOURAV CHATTERJEE 

Abstract. This paper solves the problem of sharp large deviation es- 
timates for the upper tail of the number of triangles in an Erdos-Renyi 
random graph, by establishing a logarithmic factor in the exponent that 
was missing till now. It is possible that the method of proof may extend 
to general subgraph counts. 



1. Introduction 

Let G{n,p) be an Erdos-Renyi graph on n edges with edge probability p, 
that is, a random graph on n vertices where each edge is added indepen- 
dently with probabihty p. Given a fixed smah graph H, let Xh denote the 
number of copies of H in G{n,p). The distribution of Xh has been studied 
extensively since the work of Erdos and Renyi [TD] in 1960, where the first 
results were given. In the very sparse case, where only a few copies of H can 
occur, the probability of {Xh > 0} was studied by Bollobas p]. A necessary 
and sufficient condition for the asymptotic normality of Xh (when n — >■ oo 
and p remains fixed or p — )■ 0) was obtained by Rucihski [20 1 . Sharp large 
deviation inequalities for f{XH < (1 — e)E(Xi^)), where e is a fixed positive 
number were obtained by Janson, Luczak and Rucihski |13j via the method 
of Janson 's inequality |12j . 

One key question that remained open for a long time was the issue of 
sharp large deviations for the 'upper tail', i.e. competent bounds for 

> (l + e)E(XH)), 

where e is a fixed constant. For historical accounts of this problem, one may 
look in [3l 1131 [T5]. The problem was almost completely intractable until the 
year 2000, when the first general exponential tail bound was obtained by 
Vu [22j. However, Vu's upper bound from |22j . as well as the upper bounds 
obtained by Janson and Rucihski soon after, were quite far from the 
conjectured bounds. In fact, the paper [15] surveys an exhaustive array of 
techniques, ranging from the simple Azuma-Hoeffding inequality |11] . to the 
powerful inequalities of Talagrand [21], in the context of this problem. But 
none of them yield anything close to what are believed to be the optimal 
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bounds. Similarly, the striking developments of Boucheron, Lugosi and Mas- 
sart [5], while successful in a wide range of concentration problems, could 
not quite resolve the issue of the upper tail for subgraph counts. 

The major breakthroughs in the 'upper tail problem' came with the works 
of Kim and Vu [ISJ and Janson, Oleszkiewicz and Rucihski [14J in 2004. Kim 
and Vu [H] showed that if is a triangle, then for any e > 0, there are 
positive constants Ci(e) and C2(e) such that whenever p > n~^logn, 

g-Ci(e)n2p2iog(l/p) ^ p^-^^ ^^^^ eMXH)) < e-'^2(e)nV. 

At the same time, Janson et. al. |14j proved a similar result for general H, 
with a difference of log(l/p) between the upper and lower bounds. (Note the 
lower bound is easily obtained as the probability of a single clique containing 
all the extra triangles.) 

Kim and Vu's technique is based on a method developed in their earlier 
works [17^1231 ■ which in turn is a highly sophisticated version of the method 
of martingale differences [19]. The method of Janson et. al. builds on an 
extension of a remarkable result of Alon [1] about the maximum number of 
copies of a graph H m a graph with a given number of edges. 

Subsequently, improvements in certain regimes were obtained by Janson 
and Rucihski [16] , but the problem of exactly matching the upper and lower 
bounds has remained open for the past six years. The following theorem 
closes the issue for the count of triangles by proving that the lower bound 
is sharp. 

Theorem 1.1. Let T be the number of triangles in an Erdos-Renyi graph 
G{n,p). For each e > there are positive constants Ci(e), C2(e) and C3(e) 
such that whenever Ci{e)n~^^ logn < p < C2(e), we have 

P(T > (1 + e)E(T)) < e-^3(^)"Viog(i/p)_ 

The proof does not involve martingales or counting arguments. It is some- 
what hard to decide on a nomenclature for the method; it may be tentatively 
called it a 'localization argument'. It is plausible that the technique may be 
useful in a wider class of problems. 

It must be mentioned that a couple of months after the first draft of this 
paper was posted on arXiv, DeMarco and Kahn [8] posted a different proof 
of the conjecture. The DeMarco-Kahn proof is shorter and gives a little 
more. 

It should also be mentioned that recently, a related problem has been 
investigated for dense graphs (i.e. p fixed and n —?■ oo). The objective is to 
find the exact constant C{p,€) such that 

P(r > (1 + e)E(r)) = e-n^C(p,eKi+o{i)) _ 

In the language of large deviations, this is the problem of evaluating the large 
deviation rate function. The first progress in this problem was made in [6] 
where it was shown that given p € (0, 1), there exist p^/6 < t' < t" < 1/6 
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such that for ah t G (^^6, t') U {t", 1/6), 

where Ip is the function 

/p X := - log - + — — log . 

2 p 2 1 — p 

However the result does not cover all values of {p,t). The rate function in 
the full regime has been obtained very recently in [7] as a consequence of a 
general large deviation principle for dense Erdos-Renyi graphs. Two other 
papers in this direction are [4j and [9J. 



2. Notation and terminology 

In the following, G will denote an Erdos-Renyi graph G{n,p) and T will 
be the number of triangles in G. By convention, G will denote any positive 
absolute constant, whose value may change from line to line. Similarly, 
C(e) will denote any positive constant whose value depends only on e. For 
simplicity of notation, set 

L:=log(l/p), i:=l/L. 

Given e, n and p, let us call an edge in G 'good' if there are less than einp 
triangles containing the edge. Any edge that is not 'good' will be called 
'bad'. In the same vein, call a vertex 'good' if it has less than 7np neighbors 
in G, and 'bad' otherwise. (There is nothing special about 7; any large 
enough constant is good for our purposes.) Define: 

T' := T^triangles in G with all good edges. 

To := T^triangles in G with at least one bad edge, but all good vertices. 
Ti := ^triangles in G with exactly one bad vertex and two good vertices. 
T2 := T^triangles in G with exactly two bad vertices and one good vertex. 
T3 := T^triangles in G with all bad vertices. 

Then clearly, 

(1) T < T' + To + ri +r2 + r3. 

Thus, it suffices to get upper tail estimates for the summands on the right 
hand side. This is the program for the subsequent sections. Some further 
notation and terminology will be introduced along the way. 



3. Concentration inequality 



The following theorem is most crucial component of the proof. It gives a 
generic concentration inequality for sums of dependent random variables. 
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Theorem 3.1. Let F be a finite set and {Xi)i^F, iX'i)ieF, i^j{i))i,jGF be 
collections of nonnegative random variables with finite moment generating 
functions, defined on the same probability space, and satisfying the following 
conditions: 

(a) For alii, Xi < X'-. 

(6) For all i, the random variables X[ and Y2j£F '^'^^ independent. 

(c) For all i, Y.j&F^j{i) < EjeF^j- 

(d) There is a constant a such that for all i, when Xi > 0, we have 

jeF jdF 
Let A := 'Yl,ieF ^(^lO- T'/ien for any t > X, 

^X.>t)<exp(-^(log^-l + ^))<exp(-^log^ 

ieF 

(Note that the second bound holds trivially for < t < X as well.) 

It is a bit difficult to convey the meaning of this theorem. Hopefuhy, it 
win become more transparent as the technique is apphed numerous times 
in the proof of Theorem II. 1[ The following simple lemma is the first step in 
the proof of Theorem 13. 1[ 

Lemma 3.2. Suppose X is a nonnegative random variable with a finite 
moment generating function and X and a are positive constants such that 
for any 6 >0, we have 

Then for any t > X we have 

^{X > t) < exp f - - f log V 
V a V A 



t /, t ^ X 



Proof. Let m{6) := E,{e^^) be the moment generating function of X. From 
the hypothesis of the theorem, we have 

-^log m{9) < Xe^". 
da 

Integrating, we get 

'A(e^" - ] 



m{d) < exp 
Again, for any positive 9 and t, 

^{X >t) < e"^*m(6'). 
The proof is completed by taking 9 = a^^ log(t/A). □ 
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Proof of Theorem \3.1[ Let X := ^i^pXi. Then for any ^ > 0, by the 
condition (d) and the nonnegativity of the Xj's, we have 



But again, due to the nonnegativity of Xi and the conditions (a) and (6), 
we have 

E(Xie^^jef'^^«) < E(X,'e^^J-6-F-^^(')) = E(X,')E(e^^jsf-^jW). 

Finally, by condition (c) we get 

E(e^Ej6F^j{0) < E(e^^). 

Combining the steps we get 

E(Xe^^) < Ae^"E(e^^). 

The proof of the first inequality is completed by applying Lemma 13.21 To 
get the second (weaker) inequality, simply observe that 

t t t t 

log - - 1 + - > log — > log — . 
A A eA 3A 

This completes the proof. □ 

4. Tail bound for T' 

The objective of this section is to obtain a tail inequality for the number 
of triangles with all good edges. Let us begin by introducing some notation, 
beyond what has been already defined in previous sections. Let V^'^^ denote 
the set of all unordered pairs of distinct vertices uv and let V^^^ denote the 
set of all unordered triplets of distinct vertices uvw. For each uv € V^'^\ let 



Next, let 



luv ■ 1 {lit) is an edge in G} • 



^uv ■ ^{uv is a good edge in G}- 



For each uvw G V^^\ let 

^uvw ■ ^{uvw is a triangle in G} luvlvvjluwi 



and let 
Then note that 



^uvw ■ ^{uvw is a triangle with all good edges} ^uv^vw^uw 



We intend to apply Theorem 13. II to the collection {Xuvw)- We already have 
Yuvw such that X^vw < Yuvw and therefore condition (a) is satisfied if we set 
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^'uvw — '^uvw The challenge is now to construct X^y^i^u^^-^ appropriately. 
The rest of this section is devoted to the proof of the following result. 

Proposition 4.1. There exists a positive constant C(e) depending only on 
€ such that 

P(T' > E(r) + enV) < e-^(^)"'p'^°s(i/p). 

Proof. Fix a triplet uvw G V^^\ Let K be the set of triplets that share 
no vertices with uvw and let K' be the set of triplets that share exactly 
one vertex with uvw. Let K" be the set of triplets that share two or more 
vertices with uvw. Then F^^) = K U K' U K" . 

Take any xyz G K' . Then xyz shares exactly one vertex with uvw. 
Suppose without loss of generality that x = u. Define the random variable 
^xyz{uvw) as follows. First, let 

Ty := ^triangles of the form xyr where r ^ {v,w}, 

Tz := ^triangles of the form xzr where r ^ {v,w}. 

Next, let 

Ny := ^neighbors of y in the set {v,w}, 
Nz := ^neighbors of z in the set {v,w}. 

Define 

Ey := l{Ny+Ty<eenp}, Ez := l{7V^+T^<efap}- 

Finally, let 

-^xyz{uvw) ^xyzZyzEyEz. 

Now note that since {f , w} fl {y, z} = 0, the definitions of Yxyz-, Zyz., Ty., Tz, 
Ny and Nz do not involve the edges uv, vw, uw. In particular, the definition 
of X^yz(^uvw) does not involve the edges uv,vw,uw. This is true for any 
xyz € K' . 

lixyz G K, let = X^y^. Then clearly the definition oiX^yz(^uvw) 

does not involve the edges uv,vw,uw. 
Finally, if xyz G K", let X^yz(^^y^) = 0. 

From the above construction and observations, we see that condition (6) 
of Theorem 13. II is satisfied. 

Next, suppose that for some xyz G K' with x = n, we have Xxyz{uvw) = 1 
in a particular realization of G. Then xyz is a triangle, yz is a good edge, 
and Ey = Ez = 1. We claim that xy is a good edge. To see this, simply note 
that the number of triangles containing the edge xy is bounded by Ty + Ny . 
Similarly, xz is also a good edge. Therefore X^yz = 1. Thus, for xyz G K', 
^xyz{uvw) ^ Xxyz. When xyz G K L) K", the inequality is trivially true. 
This establishes condition (c) of Theorem 13.11 

Finally, suppose that for some xyz G K' with x = u, we have Xxyz = 1 
in a particular realization of G, which also satisfies X^vw = 1- Then xv 
and xw are edges in G, and hence the number of triangles containing the 
edge xy is exactly equal to Ny + Ty. Therefore in this situation, Ey = Z^y 
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P(T' > A + enV) < exp f - ^ + ^ (logc- 1 + c-^)), 
c = c(n,p,e) 



Similarly = Z^^^. Therefore, Xxy;,(^uvw) = YxyzZxyZxzZyz = X^y^. Thus, 
when Xuvw = 1, we have X^y^i^uvw) = ^xyz for all xyz € /C'. The same is 
trivially true for xyz G K. 

Moreover, if Xuvw = 1; the number of triangles in G sharing at least two 
vertices with uvw is bounded by 2>elnp. Combining these observations, we 
see that when Xuvw = 1, 

^xyz < Seinp + ^ X^y^(^uvw) ■ 
xyz£ xyzG 

This establishes condition (d) of Theorem 13.11 with a = 3elnp. Thus, if we 
let A := Ylxyz^i^xyz) = ^^(T), then by the first inequality in Theorem 13.11 
we get 

.3^3^ ^ f ^ + eif_p^ 
einp 

where 

A + en^p^ 
A ■ 

It is easy to check that logc — 1 + converges to a positive constant 
depending only on e as — )• oo. This completes the proof. □ 

5. Tail bound for Tq 

The purpose of this section is to obtain a tail bound for the number of 
triangles in G with at least one bad edge but all good vertices. We shall 
continue to use the notation and terminology introduced in the previous 
sections. We also need to introduce some additional notation. For each 
uv G V^^\ let 

tuv • — ^ ^ luw^vw 

WS:V\{u,v} 

For a set F C y(2), let 

t{F) := J2 

uvdF 

Two elements of V^"^^ will be called 'non- adjacent' if they do not share a 
common vertex, and 'adjacent' otherwise. A set F C F^^) will be called 
'matching' if no two elements of F share a common vertex. 

Lemma 5.1. Let A he a matching. Then for each t > 0, 
¥{t{A) > t) < exp f - ^ log ■ * 



3 3|^|np2 

Proof. Let R be the set of triplets uvw such that uv £ A and w G V\{u, v}. 
Throughout this proof, uvw will denote a typical element of R. For each 
such uvw, let 

^^uvw ■ — luw^vw^ 
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Since t{A) = YluvweR ^uvw, We intend to apply Theorem 13.11 to the collec- 
tion (Wuvw) 

Fix uvw G R. Let D be the set of all xyz G R such that 
(2) {xz, yz} n {uw, vw} / 0. 

This can happen if uvw = xyz. Suppose this is not the case. Then since A 
is a matching, {u, v} H {x, y} = 0. Thus if ([2]) holds and uvw ^ xyz, then 
we must have {x, y} R {w} 7^ and {u, v} PI {z} / 0. But again because A 
is a matching, the choice of x determines the choice of y. Combining the 
above observations, we see that \D\ < 3. 

Since Wuvw depends only on the edges uw and vw, Wuvw is independent 
of the collection {Wxyz}xyzGR\D- So, if we define Wxyz(uvw) = for xyz G 
D, Wxyz{uvw) = Wxyz for xyz G R\D, and W^y-. = W^yz for all xyz G 
R, then the conditions of Theorem 13.11 are satisfied with a = 3 and A = 
'l2xyzGR^(^xyz') — I^I^P^- This completes the proof of the lemma. □ 

Next, let us define two events: 

is a matching, > Lnp, 
and all elements of F are bad edges in G}. 
E2 ■.= {3F C ^^^Yi that F is a matching, < Lnp, 
and t{F) > eri^p^}. 

Lemma 5.2. There is a constant C(e) > depending only on e such that 
whenever C{e)~^n~^ logn < p < C{e), we can conclude that both F{Ei) and 
P(£'2) CLi"e bounded by exp{—C{e)n'^p'^log{l/p)). 

Proof. If El holds, there exists a set F satisfying the conditions for Ei. 
By arbitrarily dropping some elements of F, we can find a subset F' of F 
such that Lnp < \F'\ < 2Lnp. Then F' is again a matching, and since the 
elements of F' are bad edges in G, 

t{F')> \F'\einp > en^p^. 

By Lemma fS.ll we know that for any fixed matching A C of size < 2Lnp, 

¥{t{A) > en^p^) < exp ^f- log ^ 



3 QLn^p^ , 

Clearly, there exists C(e) > such that if p < C(e), the expression of the 
right is bounded by e-'^^'^^'^'^'P^^^^-^^li') . The number of choices of ^ C 1/(2) 
with \A\< 2Lnp is bounded by 

/n(n — l)/2\ ^ ^CLnplogn 
0<k<2Lnp V ^ / 

Combining the above observations, we see that 

F{Ei) < gC-^"pios"-c(':)"Viog(i/p) 
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Thus, if C {e)^^n^^ logn < p < C'{e) for some appropriately small constant 
C"(e), we get the required bound. 

To get the bound on ¥(E2), we similarly apply Lemma 15.11 with fixed 
A C f(2) of size < Lnp and t = en^p^, and then take union bound over all 
choices oi A. □ 

Finally we arrive at the main result of this section. 

Proposition 5.3. There is a constant C(e) > depending only on e such 
that if C{e)^^n^^ logn < p < C{e), we have 

P(To > 15enV) < e'^^^)'''?'" '°s(i/p). 

Proof. Let B' be the set of bad edges with both endpoints in the set of good 
vertices. We shall now show that if Ei and E2 are both false, then t{B') < 
Iben^p^. This will complete the proof of the lemma, since Tq < t{B'). 

Recall that we call two elements of B' 'adjacent' if they share a common 
vertex in G. This defines an undirected graph structure on B'. Since the 
degree of an endpoint of any element of B' (in the graph G) is less than 7np, 
it is clear that the maximum vertex degree of the adjacency graph on B' is 
less than lAnp. Thus, there is a coloring of this graph with < lAnp+1 < 15np 
colors such that no two adjacent elements of B' receive the same color. (This 
is a standard argument in graph theory: arrange the elements of B' in some 
arbitrary order; color the ith element with a color that was not given to any 
of its neighbors among the first i — 1 elements. This produces a coloring 
such that no two adjacent elements receive the same color, and it is also 
clear that we need at most 14np + 1 colors.) 

Let us fix such a coloring. For each color c, let denote the subset of B' 
that receives the color c. Note that each Fc is a matching by construction. 

Now take the color c that maximizes t{Fc). Then 



number of colors Ibnp 
By the falsity of Ei we have \Fc\ < Lnp; and therefore, the falsity of E2 
shows that t{Fc) < en^p^. Thus, t{B') < IBen'^p^. This completes the 
proof. □ 

6. Tail bound for Ti 

In this section we bound the probability that there are too many triangles 
with exactly one bad vertex and two good vertices. As usual, we will con- 
tinue to adhere to the notation and terminology introduced in the preceding 
sections. Additionally, let us define the event 

£^3 := {There are more than Lnp bad vertices}. 

For each u & V , let du he the degree of u in G. For each A CV let 

diA) := ^ du. 
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Lemma 6.1. Let X be a binomial random variable with mean A. Then for 
any t > 0, 

F{X >t)< e-*i°g(*/3A)_ 

Proof. Since X is a binomial random variable, it can be expressed as 
where Xi are i.i.d. Bernoulli random variables. Setting X- = Xi, and -^j(i) = 
Xj ii j ^ i and = 0, we are in the setting of Theorem 13.11 with a = 1 
and A = E(X). This completes the proof. □ 

Lemma 6.2. For any set of vertices A, and any t >0, we have 
P{<i(A)>()<exp(-|logj^). 

Proof. Let Y be the number of distinct edges in G with at least one endpoint 
in ^. A simple argument shows that d{A) < 2Y. Note that y is a binomial 
random variable with mean < ?7-|^|p. Thus by Lemma 16.11 the proof is 
done. □ 

Lemma 6.3. There is an absolute constant C such that ifp > C^^n^^ logn, 
we have 

P(£'3) < e-C"Viog(i/p)^ 
Proof. For any set of vertices A with \A\ < \Lnp\, Lemma 16.21 gives that 

mA) > 7LnV) < exp ( ^ log ^^j-t-) 

The number of choices of A with |^| < [Lnp] is bounded by gCLnpXogn^ 
Thus, if we define 

Ei:= {3 A(1V with \A\ < \Lnp\ and d{A) > 7LnV}, 
then there are absolute constants C and C" such that 

F{Ei) < exp{C'Lnplogn- C"Wp^). 
So, if p > C^^n^^ log n for some suitable constant C, we have 

¥{Ei) < e-C"Viog(i/p)_ 

Next, note that if is true, then there is a set A of vertices of size exactly 
\Lnp], each of which has degree > 7np. Consequently d{A) > 7np\Lnp] > 
TLn^p^. Thus, -E3 implies E^. This completes the proof of the lemma. □ 

Proposition 6.4. There is a constant C{e) > depending only on e such 
that whenever C{e)~''^n^^ log n < p < C{e), we have 

P(Ti >enV) <e-^(^)"'^''°^(^/^^ 
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Proof. Fix a set A C y of size < Lnp. Let Ri{A) C ^^le set of 

unordered triplets with exactly 1 vertex from A and 2 vertices from A'^. Let 
ri(^) be the number of triangles in G with exactly 1 vertex from A and 2 
vertices from the set of good vertices in A^. 

A typical element of Ri{A) will be written in the form uvw, where u A 
and vw is an unordered pair of vertices from A^. For each uvw € Ri{A), let 
Puvw be the indicator that uvw is a triangle in G and v, w are good vertices. 
Then 

2~i(j4) = ^ ^ Puvw 

We shall first compute a tail bound for Ti{A). We intend to apply The- 
orem [3?1] to the collection {Puvw)uvweRi{A) ^or this purpose. As usual, let 
^uvw — ^uvw, where Yuvw is the indicator that uvw is a triangle in G. This 
clearly verifies condition (a) of Theorem 13.11 

Fix uvw G Ri{A), with u G A and v,w & A'^. Let C Ri{A) be the set 
of triplets that share no vertices with uvw and let K' C Ri{A) be the set of 
triplets that share exactly one vertex with uvw. 

Now take any xyz G K' , with x £ A, y, z G A'^. Then xyz shares exactly 
one vertex with uvw. Suppose the common vertex is in A^. Then without 
loss of generality, y = v. Define Pxyz{uvw) to be the indicator that xyz is a 
triangle in G, that z is a good vertex, and that the number of neighbors of 
y in V\{u, w} is < 7np — 2. If the common vertex is in A, that is a; = u, let 

-Pxyz{uvw) Pxyz- 

If xyz G K, let Pxyz(uvw) = Pxyz- If xyz shares two or more vertices with 

uvw, let Pxyz(uvw) = 0. 

Prom the above construction, it is clear that the definitions of the ran- 
dom variables {Pxyz{uvw)} xyz&Rr(A) do not involve the edges uv, vw, uw, and 
therefore this collection is independent of P^^^. This verifies condition (b) 
of Theorem 13.11 

Another easy verification shows that whenever Pxyz{uvw) = 1, we must 
have Pxyz = 1- Thus Pxyz{uvw) ^ Pxyz- TMs establishes condition (c) of 
Theorem 13.11 

Lastly, suppose that Puvw = 1 in a particular realization of G. If xyz G K' 
and Pxyz = 1 in that realization of G, then it is easy to see that we also 
have Pxyz{uvw) = 1- Moreover, since Puvw = 1, there can be at most 21np 
triangles sharing two or more vertices with uvw. Thus, if Puvw = 1, we have 

^ ] Pxyz ^ 21np -\- ^ ^ Pxyz{uvw)- 
xyz€Ri(A) xyzeRi{A) 

Therefore, we have established that all the conditions of Theorem 13.11 hold, 
with a = 21np and A = YlxyzeRi(A) ^(P-xyz) — n'^{Lnp)p'^. This shows that 
for any A QV with |^| < Lnp, we have 

P(Ti(A) > enV) < e-^(^)^"'p', 

provided p < G'{e) for some small enough constant C"(e). 
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Now, if Ti > en^p^, then either E3 happens, or there exists a set A V 
(of bad vertices) with |^| < Lnp and Ti{A) > en^p"^. The number of choices 
of such A is bounded by gCLnpXogn ^ Thus, there are constants C and C"(e) 
such that 

P(ri > enV) < ^{E^) + e'^'^"P'°g"-<^"(^)"Viog(i/p)_ 

If p > C(e)~^n~^ logn for some appropriately smah constant C(e), then we 
can combine the above bound with the bound on P(i?3) from Lemma [6.3l to 
complete the proof of the Proposition. □ 

7. Tail bound for T2 

In this section we bound the probability that there are too many triangles 
with exactly two bad vertices and one good vertex. As usual, we will con- 
tinue to adhere to the notation and terminology introduced in the preceding 
sections. We prove the following analog of Proposition 16. 4[ The method of 
proof is similar. 

Proposition 7.1. There is a constant C(e) > depending only on e such 
that whenever C{e)~^n~^ logn <p< C{e), we have 

F{T2 > enV) < e-^^^)"'^' . 

Proof. Fix a set A C y of size < Lnp. Let R2{A) C 1/(3) be the set of 
unordered triplets with exactly 2 vertices from A and 1 vertex from A'^. Let 
T2{A) be the number of triangles in G with exactly 2 vertices from A and 1 
vertex from the set of good vertices in A^. 

A typical element of i?2(^) will be written in the form uvw, where uv is 
an unordered pair of vertices from A and w € A'^. For each uvw € R2{A), 
let Q'u^^ be the indicator that uw and vw are edges in G, and let Quvw be 
the indicator that uw, vw are edges in G and w is a good vertex. Then 

T2{A) < S{A) := Yl 

uvw(iR2 {A) 

We intend to apply Theorem 13. II to the collection {Quvw) uvw &R2{A) to obtain 
a tail inequality for S{A). Clearly Quvw < Q'uvw ^'^d hence condition (a) of 
Theorem 13.11 is satisfied. 

Now fix uvw G R2{A), with u,v ^ A and w G A"". Let K C R2{A) be the 
set of triplets xyz such that z ^ w. Let K' be the set of triplets xyz G R2[-A) 
such that z = w but {u, v} n {x, y} = 0. Let K" be the set of xyz G R2{A) 
such that z = w and {x, y} n {n, v} / 0. Note that R2{A) = K U K' U K" . 

Take any xyz G K' . Define Qxyz(uvw) to be the indicator that xz and 
yz are edges in G, and that the number of neighbors of z in V\{u, v} is 
< 7np — 2. 

If xyz G K, let ^^^.^/^(tij,^) = Q^yz- Lastly, if xyz G K", let Qxyz(uvw) = 0. 
As in the proof of Proposition 16.41 it is clear by construction that the 
definitions of the random variables {Qxyz{uvw) }xyz(iR2{A) do not involve the 
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edges uw and vw, and therefore this collection is independent of Q'uvw This 
verifies condition (b) of Theorem 13. 11 

Again, if Q-^yz{uvw) = 1, we must have ^^^.j^^ = 1. Thus Qxyz{uvw) < Qxyz, 
which establishes condition (c) of Theorem 13. 11 

Finally, suppose that Quvw = 1 in a particular realization of G. If xyz G 
K' and Qxyz = 1 in that realization of G, then we also have Qxyz(uvw) = 1- 
Moreover, since Quvw = 1, we can easily conclude that the number of xyz G 
K" such that Qxyz = 1 is bounded by 14np. Thus, if Quvw = l, we have 

^ Qxyz < 14np + ^ Qxyz( uvw)- 
xyz£R2(A) xyz<^B,2(A) 

Therefore, we have shown that all conditions of Theorem 13.11 hold, with 
a = lAnp and A = J2xyzeR2(A)'^iQ'xyz) ^ n{Lnp)'^p'^ . This shows that for 
any A (^V with \A\ < Lnp, we have 

F{T2iA) > er?p^) < P(5(^) > enV) < e'^^'^^'^'P^ 

provided p < C'{e) for some small enough constant C"(e). 

We can now proceed exactly as in the last part of the proof of Proposi- 
tion 16.41 to complete the proof by invoking Lemma 16.31 □ 



8. Tail bound for 



In this section we obtain a tail bound for the number of triangles with all 
bad vertices. First, we need a simple lemma about the minimum number 
of edges in a graph with given number triangles. The content of the lemma 
can be derived as a corollary of Theorem A of Alon [1], but the result is so 
simple that we give a direct proof. 

Lemma 8.1. For any real symmetric matrix {aij)^,^^, we have 



afj > y^ ciijajkaki 

i,j=l i,j,k=l 



2/3 



Consequently, if an undirected graph has r triangles, then it must have at 
least i(6r)^/^ edges. 



< 



Proof. By two applications of the Cauchy-Schwarz inequality we get 

1/2 / n / n ^2^ 1/2 

Sj=l ^fc=l 

_ 1/2 

< ' 



^ ^ O'ijO'jkO'ki 
i,j,k=l 





2 































n n \ 1/2 

2 „2 



ij=ik,i=i 



3/2 
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This completes the proof of the first assertion of the theorem. For the 
graph theoretic conclusion, simply take (aij) to be adjacency matrix of the 
graph. □ 

Proposition 8.2. There is a constant C(e) > depending only on e such 
that whenever C{e)~~^n'^ log n < p < C{e), we have 

P(r3 >enV) <e-^(^)"'p'i°s(i/P). 

Proof. Take any set A <Z V with |^| < Lnp. Let T^iA) be the number of 
triangles with all vertices in A. If r3(^) > en^p^, then by Lemma |8.H the 
number of edges in A must be > C{e)n'^p'^. The probability of this event is 
bounded by 

f\A\{\A\- l)/2\^cieW^^ 



For any two integers 1 < 6 < a, it is simple to see that 

\ b / \ ^ 



^bj - bl - \ b 

Applying this bound, we see that the quantity ([3|) is bounded by 

exp (c"(6)nV + C'ie)nV log J^^^ " C"{e)nV log(l/p)) , 

where C(e), C'{e) and C"(e) are constants depending only on e. If p is suf- 
ficiently small (depending on e), the above quantity is less than e~'-^('^)^" p . 
The proof is now completed by using Lemma 16.31 as in the last part of the 
proof of Proposition 16. 4i □ 



9. Proof of Theorem 11.11 

Combining Propositions 14.11 15.31 16.41 17.11 18.21 and the inequahty ([I]), it 
is easy to see that there is a constant C(e) > depending only on e, such 
that for C{e)~^n~^ logn <p< C(e), 

P(r > E(T) + 19enV) < e-^(^)"'p''°s(^/P). 
This completes the proof. 
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